The Trouble with Sliding Windows and the Selective Pressure in BRCA1
نویسندگان
چکیده
Sliding-window analysis has widely been used to uncover synonymous (silent, d(S)) and nonsynonymous (replacement, d(N)) rate variation along the protein sequence and to detect regions of a protein under selective constraint (indicated by d(N)d(S)). The approach compares two or more protein-coding genes and plots estimates d(/\)(S) and d(/\)(N) from each sliding window along the sequence. Here we demonstrate that the approach produces artifactual trends of synonymous and nonsynonymous rate variation, with greater variation in d(/\)(S) than in d(/\)(N). Such trends are generated even if the true d(S) and d(N) are constant along the whole protein and different codons are evolving independently. Many published tests of negative and positive selection using sliding windows that we have examined appear to be invalid because they fail to correct for multiple testing. Instead, likelihood ratio tests provide a more rigorous framework for detecting signals of natural selection affecting protein evolution. We demonstrate that a previous finding that a particular region of the BRCA1 gene experienced a synonymous rate reduction driven by purifying selection is likely an artifact of the sliding window analysis. We evaluate various sliding-window analyses in molecular evolution, population genetics, and comparative genomics, and argue that the approach is not generally valid if it is not known a priori that a trend exists and if no correction for multiple testing is applied.
منابع مشابه
A Novel Ensemble Approach for Anomaly Detection in Wireless Sensor Networks Using Time-overlapped Sliding Windows
One of the most important issues concerning the sensor data in the Wireless Sensor Networks (WSNs) is the unexpected data which are acquired from the sensors. Today, there are numerous approaches for detecting anomalies in the WSNs, most of which are based on machine learning methods. In this research, we present a heuristic method based on the concept of “ensemble of classifiers” of data minin...
متن کاملMining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows
Todays, in many modern applications, we search for frequent and repeating patterns in the analyzed data sets. In this search, we look for patterns that frequently appear in data set and mark them as frequent patterns to enable users to make decisions based on these discoveries. Most algorithms presented in the context of data stream mining and frequent pattern detection, work either on uncertai...
متن کاملComparison of BRCA1 Expression between Triple-Negative and Luminal Breast Tumors
Background: Previous studies have suggested that BRCA1 dysregulation has been shown to have a role in triple-negative phenotypic manifestation. However, differences of BRCA1 expression, as a tumor suppressor gene, have rarely been investigated between luminal and triple-negative breast tumors. Therefore, the present study attempted to compare the BRCA1 expression in triple-negative with lu...
متن کاملEffects of Using Lipsticks with any Lead Content on the BRCA1 Gene Mutations
Introduction: Breast cancer is one of the leading causes of cancer death in women. Variations in the BRCA1, BRCA2, CDH1, STK11 and TP53 genes increase the risk of developing breast cancer. In addition to specific genetic changes, environmental factors may influence an individual’s risk of developing breast cancer. Lead is one of the most dangerous chemicals in the air as well as many products,...
متن کاملFunctional investigation of the BRCA1 Val1714Gly and Asp1733Gly variants by computational tools and yeast transcription activation assay
Mutations in the BRCA1 gene are known to be a major cause of hereditary breast cancer. However, characterizing the point mutationsassociated with cancer in BRCA1 is challenging because the functional impact of most of them is still unknown. Nowadays, a variety of methods are employed to identify cancer-associated mutations in BRCA1. This study is aimed to ass...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PLoS ONE
دوره 3 شماره
صفحات -
تاریخ انتشار 2008